A Multi-modal Deep Neural Network approach to Bird-song Identication

نویسندگان

  • Botond Fazekas
  • Alexander Schindler
  • Thomas Lidy
چکیده

We present a multi-modal Deep Neural Network (DNN) approach for bird song identification. The presented approach takes both audio samples and metadata as input. The audio is fed into a Convolutional Neural Network (CNN) using four convolutional layers. The additionally provided metadata is processed using fully connected layers. The flattened convolutional layers and the fully connected layer of the metadata are joined and fed into a fully connected layer. The resulting architecture achieved 2., 3. and 4. rank in the BirdCLEF2017 task in various training configurations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Convolutional Neural Networks for Large-Scale Bird Song Classification in Noisy Environment

This paper describes a convolutional neural network based deep learning approach for bird song classification that was used in an audio record-based bird identification challenge, called BirdCLEF 2016. The training and test set contained about 24k and 8.5k recordings, belonging to 999 bird species. The recorded waveforms were very diverse in terms of length and content. We converted the wavefor...

متن کامل

Experimental and finite-element free vibration analysis and artificial neural network based on multi-crack diagnosis of non-uniform cross-section beam

Crack identification is a very important issue in mechanical systems, because it is a damage that if develops may cause catastrophic failure. In the first part of this research, modal analysis of a multi-cracked variable cross-section beam is done using finite element method. Then, the obtained results are validated usingthe results of experimental modal analysis tests. In the next part, a nove...

متن کامل

Simplified Stochastic Feedforward Neural Networks

It has been believed that stochastic feedforward neural networks (SFNNs) have several advantages beyond deterministic deep neural networks (DNNs): they have more expressive power allowing multi-modal mappings and regularize better due to their stochastic nature. However, training large-scale SFNN is notoriously harder. In this paper, we aim at developing efficient training methods for SFNN, in ...

متن کامل

Making Stochastic Neural Networks from Deterministic Ones

It has been believed that stochastic feedforward neural networks (SFNN) have several advantages beyond deterministic deep neural networks (DNN): they have more expressive power allowing multi-modal mappings and regularize better due to their stochastic nature. However, training SFNN is notoriously harder. In this paper, we aim at developing efficient training methods for large-scale SFNN, in pa...

متن کامل

Multi-modal Deep Learning Approach for Flood Detection

In this paper we propose a multi-modal deep learning approach to detect floods in social media posts. Social media posts normally contain somemetadata and/or visual information, therefore in order to detect the floods we use this information. The model is based on a Convolutional Neural Network which extracts the visual features and a bidirectional Long Short-Term Memory network to extract the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017